Minimizing volatility increases large risks 
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Abstract : We introduce a faithful representation of the heavy tail multivariate 
distribution of asset returns, as parsimonous as the Gaussian framework. Using 
calculation techniques of functional integration and Feynman diagrams borrowed from 
particle physics, we characterize precisely, through its cumulants of high order, the 
distribution of wealth variations of a portfolio composed of an arbitrary mixture of 
assets. The portfolio which minimizes the variance, i.e. the relatively "small" risks, 
often increases larger risks as measured by higher normalized cumulants and by the 
Value-at-risk. 



Finance is all about risks and risk is usually quantified by the volatility. As is now 
well recognized, due to the presence of heavy tails and long-range correlations, the 
volatility is only an imperfect measure of risk. In principle, the risk associated with 
a given portfolio is fully embedded in the multivariate distribution of the returns of 
these assets. Practically, dealing with this multivariate distribution is a formidable 
task for both its specification (important for scenario simulations), for portfolio opti- 
mization and for the control of risks. Until now, simpler one-dimensional measures of 
risks have been developed, for instance in terms of the Value-at-risk. However, they 
suffer from their reliance on a stable and accurate determination of the covariance 
matrix of returns, which is problematic in the presence of heavy tails and of time- 
varying volatilities and correlations. A variety of methods have been also proposed 
that are however all limited in their domain of application. 

Here, we focus our attention upon the "fat tail" problem, having in mind that 
a large part of the time-varying volatilities and correlations may result from their 
unstable determination precisely due to the presence of non-gaussian effects. Gener- 
alization of our "fractal" covariance matrix approach described below in the spirit of 
GARCH models is straightforward and will be described elsewhere. 

To address the "fat tail" problem, we present three important innovations. First, 
we develop a new method that provides an approximate but faithful representation of 
the full multivariable "fat tail" distribution of asset returns. Second, we adapt theo- 
retical tools from theoretical physics to calculate precisely the distribution of returns 
of the full portfolio. Third, we compare different portfolio optimization procedures 
and show that minimizing the variance is not optimal as it may often increase large 
risks. We provide the relevant tools for better optimization suitable to a given risk 
aversion. 

1 "Fractal" Covariance Approximation 

Consider two heterogeneous assets, such as the US index SP500 and the Swiss Franc 
(CHF), both quoted in US dollars. The empirical joint bivariate distribution of their 
dayly annualized returns 



is plotted in Fig.l for the time interval from Jan. 1971 to Oct. 1998. Si(t) is the 
price at time t valued in US dollars, where % = 1 for the SP500 and % = 2 for the 
CHF. The contour lines define the probability confidence level : 95% of the events 
fall inside the domain limited by the outer line. Thus, there is a 5% probability to 
observe events falling outside. The other confidence levels of 90%, 50% and 10% are 
similarly defined. Fig. 1 also shows the marginal distributions for the SP500 and the 
CHF in US$. The abcissa axis are the same as for the bivariate representation so 
that the projection from the bivariate to the monovariate distributions is highlighted. 
The ordinate of the marginal distributions uses a logarithmic scale : a linear plot then 
qualifies an exponential distribution. One can observe that, while the distributions 
are not far from an exponential, they exhibit a slightly upward curvature in the tails 
indicating a slightly more heavy tail than the exponential. 





1.1 Contracting maps as a new quantification of departure 
from Gaussian 



Let us call Fi(r) and i 7 ^?") their cumulative marginal distributions, giving the prob- 
ability that the return be less than r. Let us introduce the transformation r*i — > y\ 
and r 2 — > i/2 which transforms Fi(ri) and ^2(^2) into Gaussian distributions with 
unit variance. By the conservation of probabilities, this reads 



(2) 



where erf(y) is the error function. We can rewrite it in order to make explicit the 
nonlinear transformation from the r variables to the y variables : 

3/1,2(7-1,2) = v^err 1 (2F 1)2 (r 1)2 ) - 1) (3) 

where erf -1 is the inverse of the error function. The Gaussian y variables, together 
with the nonlinear transformation embody fully and with no approximation the 
heavy tail nature of the marginal distributions. 

As an analytic illustration, consider stretched exponential (Weibull) distributions 
Q of the form 

P(r) = — — — ;= r 2 e 'V . (4) 

This function provides a reasonable fit to the distributions F\{r\) and Fi{r<i), espe- 
cially in the tails as shown in Fig. 1, with A\ = 4500, c\ = 0.7, roi = 0.79 and 
A2 = 700, c 2 = 1.1, tq2 = 2.13. It is clear that the tails are much "fatter" than for a 
Gaussian. 

In this case, the change of variable ([3D can be written, using a slight change of 
normalization, as 

yi (t) =sign(ri(t)) |ri(t)|t . (5) 

The distribution of j/j is Gaussian with a variance now equal to Va = (rj ) Q . 

We stress that the transformation (§) is exact and valid for any distribution. It 
will be used in the simulations. In contrast, expression (|5|) is exact only for Weibull 
distributions. It is found to provide a good approximation of the tails of the return 
distributions. It is used below to present the novel theoretical approach. 

Fig. 2 shows yi as a function of r\ and 2/2 as a function of r2 from the data 
and the comparison with (^) using the same parameters as above and shown in 
Fig. 1. The negative returns have been folded back to the positive quadrant. The fits 
with expression (|5p shown in Fig. 2 are good for the large values, while deviations 
for small returns indicate that the departure from a Gaussian is less strong in the 
center of the distributions. This plot provides a novel quantification of the departure 
from a Gaussian. The downward curvatures result from the fact that the tails of 
the distribution are "fatter" than a Gaussian: the r —>■ y transformation is thus a 
contracting map. 



1.2 Optimal multivariate distributions for "fat tails" 



To put our next step into its relevant context, we recall that multivariate Gaus- 
sian distributions have played and still play a key role, not only because they are 
convenient to use, but also because they are optimal in an information theoretical 
sense : with the only prior information of the covariance matrix, they contain the 
least possible assumptions, in other words they are the most likely representation 
of the data. As already pointed out, they are however inconsistent with the pres- 
ence of heavy tails and non-normal dependence. In this light, we can now capitalize 
upon the transformation and include the information on the heavy tails to better 
characterize the multivariate asset return distributions. In this goal, the dependence 
between the assets is characterized by the covariance matrix V of the transformed 
Gaussian variables y's : 

V = (Y Y T ) - (Y) (Y T ) , (6) 

where (Y) denotes the expectation of Y and Y is the unicolumn matrix with elements 
Ui and y2- This generalizes straightforwardly for a larger number N of assets. For 
stretched exponential variables for which the relation ([5]) holds, the definition (|6|) 
leads to the covariance elements 




Note that the essential information on the sign of the returns is kept while a fractional 
power of their amplitudes is taken, hence the term "fractal" covariance matrix (that 
we keep even for the general case to refer to the contracting nature of the r — > y 
mapping). has a faster convergence rate for sparse data and is better behaved 
statistically than the usual covariance matrix since it is less sensitive to large fluctu- 
ations due to the small power c/2. As a test, we have verified that the normalized 
correlation coefficient py = ^y^y 22 f° r the covariance matrix for the y\ and yi vari- 
ables is significantly more stable than the usual correlation coefficient p v = -j=^== 
for the covariance matrix of the returns r\ and function of time in running; 

windows of various sizes. The introduction of ARCH models and their generalizations 
has been motivated by the observed non-stationarity of the usual covariance matrix 
0. The improved stability of pv suggests that this non-stationarity results in part 
from the inadequacy of the covariance matrix to provide an efficient characterization 
of the asset risk profiles, resulting from the presence of "fat tails" . Our new approach 
directly addresses this problem. 

Conditionned only on the measurement (^) of the "fractal" covariance matrix 
V, the most likely representation of the time series becomes the usual Gaussian 
multivariate distribution in terms of the y variables : 

P(Y) = ( 27 r)-^ 2 iVl-^exp (-§ (Y T - (Y T )) V~ l (Y - <Y») , (8) 

where \V\ is the determinant of V. We stress that this parameterization is funda- 
mentally different from the usual Gaussian approximation on the price returns r. 
To get the implied multivariate distribution -P(R) in terms of the return variables 
R T = {r±, r 2 }, we use the identity P(R) = -P(Y) where ^ is the jacobian of the 



transformation from R — > Y : 

N j F 

P(R) = \V\-^ exp -i (Y T - (Y T )) (V- 1 -I) (Y - (Y») II f 1 ^) , (9) 

j=i ar j 

where V is again the covariance matrix for Y (i.e. the "fractal" covariance matrix 
for R) and / is identity matrix. Changing the normalization as in the change of 
variable (0) leads to the same form (|Sp except for the identity matrix I being changed 
into the diagonal matrix of elements Vu = (roi) Ci . This representation is exact for 
arbitrary uncorrelated variables, in which case V — I. It is also exact for a Gaussian 
distribution modified by monotonic one-dimensional variable transformations for any 
number of variables, or equivalently by multiplication by a non-negative separable 
function. This method has recently been independently introduced in the context of 
multivariate distributions of particle physics data ||. 

Fig. 3 presents the bivariate distribution -P(Y) obtained from Fig. 1 using the 
transformation (|3|) as well as the corresponding Gaussian marginal distributions. The 
contour lines are defined as in Fig. 1. Note their smooth elliptic shape that contrast 
with the diamond shape shown in Fig. 1. The principal axis of the ellipses are almost 
perfectly along the yi,y2 axis, a signature of the weak "fractal" correlation between 
the SP500 and the CHF. In the limit of absence of correlation, the ratio of the small 
over large principal axis is equal to \[y^- 

As a simple and efficient "goodness of fit" test for the reliability of this repre- 
sentation (H), we have studied the fraction of events (points) shown in Fig. 3 within 
an ellipse of equation \ 2 = (Y T — (Y T ))V~ 1 (Y — (Y)) as a function of the \ 2 
density (1/2) e~ x I 2 for two degrees of freedom. We observe a very straight bisector 
line which qualifies the multivariate Gaussian representation (|9|). Varying x 2 horn 
to 1 spans the distribution from the small most probable returns to the large least 
probable returns. 



2 Characterization of portfolios 
2.1 Empirical investigation 

We can now capitalize upon the rather good stationarity properties of the repre- 
sentation of the bivariate distributions provided by and use this information to 
optimize portfolios and characterize risks. Consider a portfolio investing a fixed frac- 
tion p of its wealth W in the SP500 and the remaining fraction 1 — p in the CHF. 
Using the historical time series, we construct numerically the time series W(t) from 
the recursion 

W{t + 1) = pW(t) Sl (t) + (1 - p)W(t)s 2 (t) (10) 

which ensures that p is fixed. The annualized dayly return of W(t) is defined by 
r w{t) = 250 In W -^t t P ■ Fig. 4 shows the dependence as a function of p of the variance 



C 2 = ((r w - (r w )) 2 ) 



(11) 



and of the kurtosis 

Q _ ({r w - (r w )f) 

C? ~ ((r w - {r w ))r d ' [iZ) 

of the dayly portfolio returns. The kurtosis quantifies the deviation from a Gaussian 
distribution and provides a measure for the degree of "fatness" of the tails, i.e. a 
measure of the "large" risks. Taking into account only the variance and the kurtosis 
and neglecting all higher order cumulants, a distribution can be approximated by the 
following expression valid for small kurtosis 

P ( ^exp[- < r ^» 2 (l - g (r »-^» 2 )] . (13) 

The negative sign of the correction proportional to k means that large deviations are 
more probable than extrapolated from the Gaussian approximation. For a typical 
fluctuation \S — (S\) ~ VC2, the relative size of the correction in the exponential is 
||. For the large values of k found below this approximation ( |13"1) break down and 
the deviation from a Gaussian is much more dramatic. 

As seen in Fig. 4, the variance has a well-defined quadratic minimum at py = 
0.375. The kurtosis has a S-shape with two local minima at p K2 = —0.405 (absolute 
minimum) and p R \ = 0.125 (local minimum). The table gives the corresponding 
variance C 2 and kurtosis n for these three portfolios and for the benchmark ps — 0.5. 



p 


c 2 


K 


n 


c 


VaR (20 days) 


VaR (10 years) 


Pb = 0.5 


2.42 


19.9 


1.0 


0.75 


-3.77 


-19.4 


p v = 0.375 


2.28 


9.53 


1.77 


1.09 


-4.41 


-13.6 


p K i = 0.125 


2.85 


4.20 


3.44 


1.73 


-6.12 


-12.4 


p K2 = -0.405 


7.77 


3.92 


4.39 


1.35 


-9.19 


-22.8 



Table: p (resp. 1 — p) is the weight in value invested in the SP500 (resp. CHF). 
C 2 (resp. k) is the variance (resp. kurtosis) of the distribution of returns of the 
portfolios. r£ and c are the scale and exponent of the Weibull fit to their tail. The last 
two columns report the calculated Value-at-Risk at the 95% and 99.96% confidence 
levels. 



The conclusion of this analysis is striking: the portfolio with p K \ = 0.125 has 
a variance only 25% higher than that of the minimum variance portfolio while its 
kurtosis is smaller than half that of the minimum variance portfolio. It is thus 
possible to construct a portfolio which has about the same degree of "small" risks 
(as measured by the variance) while having significantly smaller "large" risks than 
would give the standard "mean- variance" portfolio approach 

This result can also be interpreted in a way that highlights the danger of standard 
practice : minimizing "small" risks as quantified by the variance may increase (here 
more than double) the "large" risks. In trouble times of large volatity fluctuations, 
it is particularly important to recognize this fact. Fig. 5 further exemplifies this 
phenomenon by plotting the cumulative distributions F(rw) for the four portfolios 



in an inverse axis representation, corresponding to the so-called Zipf or rank-ordering 
plot : this representation of the nth largest value as a function of its rank n emphasizes 
the information in the tail of the distribution. We can collapse the tails of the 
distributions of the four portfolios by choosing suitable pairs of parameters c and 
for each portfolio distribution and by plotting {ry//^e) c as a function of Inn: this 
collapse is the signature that all the tails are approximately of the same functional 
form (0) and that we have correctly identified the values of the parameters. The table 
lists the values of c and re that best fit the tail of each portfolio return distribution. 
The portfolio with p = 0.125 provides the best compromise with a low variance and 
a low kurtosis: not surprisingly, the exponent c of its tail is the largest corresponding 
to the faster asymptotic decay (thinnest tail). 



2.2 Theoretical formulation 

We now present briefly how these stylized facts can be rationalized by a systematic 
theory based on the representation (|9|). Up to a very good approximation, it is 
harmless and much simpler to replace the returns rj(t) defined in ([I]) by (si(t + 
1) — Si(t))/si(t) and, over reasonable large time intervals (e.g a year), neglect the 
variation the denominator in comparison to the variation of the numerator 5si(t) = 
Si(t + 1) — Si(t). The dayly wealth variation at time t of a portfolio of N assets reads 

N 

SW(t) = J2p l Ss i (t) , (14) 

i=l 

where pi is again the weight in value of the ith asset in the portfolio. We normalize 
the weights YliLiPi — 1- Our strategy is to express the 5si(t) variables as a func- 
tion of the yi(t) using (|3]) and calculate directly the distribution P(5W) of the dayly 
portfolio wealth variations. We stress that P(5W) embodies completely all possible 
information on risks and in particular embodies the usual volatility and VAR mea- 
sures. We illustrate the procedure for the case of Weibull distributions for which (||) 
reduces to (d) : 

8W(t)=Y,Pi sign(Vi) |yih • (15) 
i=i 

The formal expression for P(5W) is 

N / r \ _ / N 2\ 

P(8W) = CU ( dyA e-s YT ^ lY 5 (5W(t) - ]y>sign( yi ) jy.^ . (1 6 ) 

In order to simplify the notation, we assume that the average price variations are 
zero. It is easy to reintroduce non-zero average returns in the formalism. Taking the 
Fourier transform of (|i~6D, we get 

where Cj = 2/qi. We only show the expression ( |I7f ) for the case where are integers 
and odd such that the "interaction" terms sign(yj) \yi\ c i simplify into yf. Note that 



the case q = 3 corresponding to an exponent c = 2/3 is realistic empirically for the 



SP500 data. Our results below holds for general g's. Expression (|17D bears strong 
ressemblance with quantities that appear in field theories of particle physics and we 
have used the relevant "technology" to evaluate it. 

For q = 1, i.e. c = 2, the change of variable ([S]) is linear, all integrals are gaussian 
which yields the standard result that the distribution P(5W) is Gaussian with a 
variance 

C 2 =p T Vp. (18) 

This retrieves the results covered by the standard Markovitz's theory 0] at the basis 
of the CAPM §. 

Consider now the more general "heavy tail" case of arbitrary q > 1, i.e. c = 
2/q < 2. For uncorrelated assets, V is diagonal and the multiple integral becomes 
the product of one-dimensional integrals. We have shown || that cumulants of P(rw) 
of all orders can be calculated exactly : 

C 2n (q) = J2C(n, qi ) (p 2 v tl )\ (19) 

i 

where C(n,q) is a function of n and q 0. We have (7(1, g) = (2 9 /y / 7r)r(g + 1/2) 
and (7(2, q) = (2 2q / 'y/F) T(2q + 1/2) - (3 2 2<} /n)[T(q + 1/2)] 2 , where T is the Gamma 
function. In this diagonal case, the g^th power of the variance of i/i is equal to the 
variance vu of the ith asset dayly price variation 5si, leading to (Vu) q = va. We 
stress that this expression (|H]) is valid even when q is real and the interaction term is 
cx sign(?/ i )|?/j|' ? and thus applies to arbitrary Weibull exponential distributions. Odd 
cumulants are vanishing due to our restriction to distribution with zero mean. 

It is well-known that, conditionned on mild regularity conditions, the knowledge 
of all cumulants uniquely determines the distribution function P(5W). We have 
thus been able to characterize fully in this case all aspects of risks associated to a 
given portfolio. Recall that the cumulant C2 is the variance of the portfolio wealth 
variation distribution. The normalized fourth cumulant k = l^f is its kurtosis. As 
already mentionned, it is zero for a Gaussian distribution and provides a standard 
measure of departure from Gaussian. Higher order cumulants quantify the deviation 
from a Gaussian further in the tail of the distribution. 

We have also been able to calculate the cumulants for the correlated case. The 
calculation is significantly more involved and uses a systematic Feynman diagram- 
matic procedure [0, || that has been invented in quantum electrodynamics ||. The 
results and corresponding empirical tests will be given in M. 

This completes our brief summary of our complete analytical determination of 
the distribution of the portfolio wealth variation for multivariate correlated fat tail 
multivariate distributions. Our technique can be extended to more general asset 
distributions of the form P(r) = e~^ r \ as long as f(r) — > +00 for \r\ — ► +00 no 
slower than a power law with positive exponent. This condition covers all cases of 
practical interest. 

We now use these analytical results to generalize our empirical finding that min- 
imizing "small" risks as quantified by the variance often increases significantly the 
"large" risks. 



3 Risk quantification 



3.1 Optimal portfolios 

To keep the presentation simple, we consider the uncorrected diagonal case fll9|) . 
Being presented the full spectrum of cumulants that quantify all possible measures 
of risks, we now determined two "optimal" portfolios. 

• The first porfolio Py has the smaller variance. The corresponding asset weights 
are found to be : 

1 . . 

PlVll = P2V22 = ■■■= PnVnn = ^T~r > ( 20 ) 

v if 



where Vu is the variance of the zth asset. The assets contribute to this portfolio 
in value inversely proportional to their variance. 

The second portfolio P# has simultaneously the smallest kurtosis % and small- 

°2 

est higher normalized cumulants \2 m = (cly™ f° r m > 2. The corresponding 
asset weights are : 

1/2 1/2 1/2 1 / n1 x 

PiVn = P2V22 = ...= PnV^n = v 1 • ( 21 ) 

Since the normalized cumulants \2 m with m > 2 measure the deviation from a 
Gaussian in the tail, Pk minimizes the large risks. 



3.2 Small versus large risk optimization 

The asset weights given by (|2lf) do not minimize the portfolio variance but do cor- 
respond to the smallest possible large risks. Reciprocally, the asset weights given 
by (f2"UD, that minimize the portfolio variance, increase the large risks. We state two 
results among several others that we have obtained that generalize this observation. 
Let us denote 

1 

Xi = 



2^7 = 1 1/2 



the relative inverse risk brought by asset i. Let us also call A^j, (resp. Xim) the 
normalized cumulant of order 2m of the portfolio Pk (resp. Py)- Then, 



A w 1 (EiXf 

A 2m 1 V 



4m Nm -\E,Xj- 

1 ^ ^ ltn n litrmrn c .ill /-*v Ar (1/ n 1 c\ 1 -i- r-\ \ ) 



(22) 



We thus find that A^ is always smaller or equal to \ 2n l for m > 2 for all possible 
values of Xj's. The equality occurs only for all X^s being equal to 1/N, i.e. for 
assets with identical variances. This demonstrates that the weights that minimize the 



variance increase the higher normalized cumulants. It also interesting to compare the 
portfolio Pk with the benchmark portfolio Pi/at defined by pi = p 2 = ... = Pn = l/N. 
We find 



ratio = 




Notice that changing all variances vu into their inverse change the ratio of kurtosis 
into its inverse. This implies that, if we find a set of v^s for which the ratio of kurtosis 
is smaller than one, then the set of the inverses 1/va's gives a ratio of kurtosis larger 
than one. This proves that there are many situations for which minimizing the 
variance of the portfolio may either increase its kurtosis and therefore its large risks 
as compared to that of the benchmark. 

3.3 Empirical test 

3.3.1 Kurtosis 

Fig. 6 compares the dependence of the empirical kurtosis shown in Fig. 4 to the 
prediction obtained from Eq. ([19]) of the theory. We use the result for uncorrelated 
assets as the coefficient of correlations are small p v ~ pv ~ 0.03. We have checked 
that taking into account the non-zero value of p does not change significantly the 
results. 

We show six theoretical curves for all the combinations of the values c\ = 0.7, 
Ci = 0.8 and c 2 = 1.05, c 2 = 1.1 and c 2 = 1.15. For relatively large positive ('long') 
and negative ('short') weights p of the SP500, the kurtosis k is mostly sensitive to the 
estimation of the exponent C\ of the SP500 return distribution, because the SP500 
has the fatest tail (smallest exponent c). For small values of p, the reverse is true 
and the portfolio kurtosis is mostly sensitive to the exponent c 2 of the CHF return 
distribution. The empirical determination shown in Fig. 4 is replotted as circles. In 
the domain of p with reasonable variance and kurtosis, we find a quite good agreement 
for ci = 0.75, c 2 = 1.15. The other theoretical curves provide the range of uncertainty 
in the kurtosis estimation coming from measurement errors in the exponents c. The 
main point here is that the theory adequately identifies the set of portfolios which 
have small kurtosis and thus small 'large risks' and still reasonable variance ('small 
risk'). We stress the importance of such precise analytical quantification to increase 
the robustess of risk estimators : historical data becomes notoriously unreliable for 
medium and large risks for lack of suitable statistics. 

3.3.2 Value-at-Risk 

As a final test, we show how the different portfolios perform with respect to the 
Value-at-Risk (VaR) at different confidence levels. Recall that the VaR determines 
the probability of a portfolio of assets losing a certain amount in a given time period 
due to adverse market conditions with a particular level of confidence Cl 0. For 
instance, a VaR- measure of one millon dollars at the Cl = 95% level of confidence 
implies that total portfolio losses would not exceed one million dollars more than 



1 — Cl — 5% of the time (i.e. typically one day in twenty) over a given holding 
period. In essence, VaR provides a measure of extreme events that occur in the lower 
tail of the portfolio's return distribution. 

We have estimated the VaR for each of the four portfolios both from historical 
data and from the stretched exponential model. For each weight, we constructed the 
distribution of returns P(rw) obtained from (|i0|) and estimated directly the VaR such 
that the fraction of negative returns smaller than VaR is 1 — Cl- Mathematically, this 
corresponds to determine the return rw such that F{rw) = 1—Cl- The corresponding 
VaRs at the Cl = 95% level are given in the table. This confidence level corresponds 
to a typical maximum dayly loss encountered once every 20 days. 

An independent estimation was performed by using the fits of the distributions of 
rw by stretched exponentials, with the values of c and reported in the table. That 
the portfolio distributions can still be considered of this form in their tail is validated 
by an "extreme deviation" theorem [IT]]. Then, the VaR is solution of 



V2 



(24) 



which has to be solved with respect to Var. The additional multiplicative factor 
a ~ 10 accounts for the empirical fact that the stretched exponential is valid only 
in the tail of the distribution, a has been calibrated for one confidence level and 
checked to remain approximately the same for the others. 

This calibration allows us to predict the VaR at higher confidence levels, i.e. for 
dayly losses that can typically occur over longer period of times than 20 days. For 
relatively low confidence interval like Cl = 95%, we find that the VaR for the Variance 
portfolio Py p = 0.375 is significantly smaller than that for the kurtosis portfolio Pk 
with p = 0.125. But since the exponent c of Pk is larger than that of Py, the tail of 
Pk is bounded to become thinner and the VaR of the kurtosis portfolio Pk is bound 
to become smaller than that of the variance portfolio Py at high confidence levels. 
We calculate that the cross-over occurs approximately at a confidence level of 99.93% 
corresponding to a typical largest dayly loss of about 12% occurring once every five 
years. For larger time horizon, the kurtosis portfolio becomes better, having a smaller 
VaR. We show the VaR at the confidence level of 99.96% corresponding to the decadal 
dayly shock, i.e. to the typical largest loss seen once every ten years. As expected, 
the kurtosis portfolio has the smallest VaR. 
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FIGURE CAPTIONS 



Fig. 1: Bivariate distribution of the dayly annualized returns of the SP500 US 
index 1 and of the CHF 2 (in US $) for the time interval from Jan. 1971 to Oct. 
1998. Half of the data is represented for clarity of the figure. The contour lines define 
the probability confidence level of 95% (outer line), 90%, 50% and 10%. The upper 
and left diagrams show the projected marginal distributions for the SP500 and the 
CHF in US$ and their fit to (g). The parameters of the fit are Ay = 4500, c x = 0.7, 
r i = 0.79 and A 2 = 700, c 2 = 1.1, r 02 = 2.13. 

Fig. 2: Dependence of the gaussian variables defined by ([3]) as a function of the 
return for the SP500 and CHF data shown in Fig. 1. The negative returns have been 
folded back onto the positive quadrant. The continuous lines are given by (|5]) with 
Ci = 0.7 and c 2 = 1.1 respectively for the SP500 1 and CHF 2. 

Fig. 3: Bivariate distribution -P(Y) obtained from Fig. 1 using the transformation 
(H). The contour lines are defined as in Fig. 1. The upper and left diagrams show the 
corresponding projected marginal distributions, which are gaussian by construction 
of the change of variable (|3|). Both are fitted by the continuous line of equation 
P lj2 = 150 exp(-| 2/li2 | 2 /2). 

Fig. 4: Empirical dependence as a function of p of the variance C 2 and of the 
kurtosis k of the distribution of returns r w (t) = 250 In W J^^ of a portfolio with a 
fraction p (resp. 1-p) in value invested in the SP500 index (resp. in the CHF), whose 
total value is given by (|10|). The variance has a well-defined quadratic minimum at 
Pv = 0.375. The kurtosis has a S-shape with two local minima at p K 2 = —0.405 
(absolute minimum) and p K \ = 0.125 (local minimum). The table gives the corre- 
sponding variance c 2 and kurtosis k for these three portfolios and for the benchmark 
p B = 0.5. 

Fig. 5: Rescaling of the distributions P(rw) of returns rw obtained from the 
four portfolios studied in the table. The rescaling uses for the ordinate the reduced 
variable {rw/^i) c where the exponent c and the characteristic return scale r£ have 
been determined by a direct fit to each portfolio return distributions. The abcissa is 
the rank n of the nth largest value plotted along the ordinate. This rank-ordering 
plot, which is the same as a cumulative plot, but with reversed axis, emphasizes the 
information contained in the tail. The symbols correspond to: + : p = —0.405; o : 
p = 0.125; * : p = 0.375; x : p = 0.5. The straight line has equation 7.50 — 1.16 Inn. 

Fig. 6: Comparison of the empirical kurtosis (circles) shown in Fig. 4 with the 
prediction obtained from Eq. ([T^) of the theory. The six theoretical curves correspond 
to all combinations of pairs of values c\ = 0.7, c\ = 0.9 and c 2 = 1.15 (solid line); 
c 2 = 1.1 (dotted-dashed line) and c 2 = 1.05 (dotted line). 



